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Abstract —Spatially coupled low-density parity-check (SC- 
LDPC) codes can achieve the channel capacity under low- 
complexity belief propagation (BP) decoding, however, there 
is a non-negligihle rate-loss because of termination effects for 
practical finite coupling lengths. In this paper, we study how we 
can approach the performance of terminated SC-LDPC codes 
by random shortening of tail-biting SC-LDPC codes. We find 
the minimum required rate-loss in order to achieve the same 
performance than terminated codes. We additionally study the 
use of tall-biting SC-LDPC codes for transmission over parallel 
channels (e.g., bit-interleaved-coded-modulation (BICM)) and 
investigate how the distribution of the coded bits between two 
parallel channels can change the performance of the code. We 
show that a tail-biting SC-LDPC code can be used with BP 
decoding almost anywhere within the achievable region of MAP 
decoding. The optimization comes with a mandatory buffer at 
the encoder side. We evaluate different distributions of coded 
bits in order to reduce this buffer length. 

1. Introduction 

Their excellent performance under low-complexity belief 
propagation (BP) decoding renders spatially coupled LDPC 
codes (SC-LDPC) attractive for error correction subsystems 
in upcoming communication systems such as, e.g., long- 
haul optical fiber transceivers ||T|. It was proven in Pj-Q 
that SC-LDPC codes can achieve the capacity of binary- 
input, memoryless, symmetric-output (BMS) channels under 
BP decoding. In particular, it was proven that the BP threshold 
of an SC-LDPC ensemble asymptotically converges to the 
threshold of the underlying LDPC ensemble under maximum- 
a-posteriori (MAP) decoding. This phenomenon is called 
threshold saturation. 

The idea of spatial coupling is to replicate L ^ 1 copies 
of an LDPC code along a spatial dimension and connecting 
the individual LDPC codes by swapping edges in a way that 
the local graphical structure of the original code is preserved. 
Terminating the replicas in an effective way triggers a “decod¬ 
ing wave” Q and threshold saturation occurs. Termination is 
usually performed by setting all code bits of a few LDPC 
copies at the boundaries of the spatial chain to zero Q, 
Q, reducing the total encoding rate. Although this rate-loss 
vanishes as L —>■ oo, it is not negligible for practical finite 
values of L. 

Using tail-biting SC-LDPC codes Q is a promising way 
to mitigate the rate-loss for a practical finite L. Threshold 
saturation occurs if there are enough reliably received code bits 
in just a few spatial positions. First, these reliable code bits are 
decoded and then, the decoder progresses successively along 


the spatial dimension. These reliable code bits can be obtained 
by shortening, as suggested (however without analysis) in ||^. 

If transmission takes place over different parallel channels, 
the required reliable code bits can be provided without short¬ 
ening (and rate-loss) by carefully interleaving the code bits 
among channels with different entropies. Examples of parallel 
channels include bit-interleaved coded-modulation (BICM) 
for high spectral efficiency modulation formats and multi¬ 
carrier transmission, such as orthogonal frequency division 
multiplexing (OFDM) or transmission over two polarizations, 
frequently occurring in optical communications. This approach 
has been first proposed in Gg, 0 where optimized bit 
interleavers for BICM have been numerically derived based 
on a Gaussian approximation of log-likelihood ratio (LLR) 
densities of channels and a P-EXIT approximation for 
density evolution. 

In this paper, we study both above approaches in a unified 
framework. We also investigate the effect of an additional 
practical constraint, the encoding buffer size required for 
interleaving. We use the technique of density evolution (DE) 
over a BEC to minimize the total amount of shortening. We 
show that the results almost immediately carry over to a binary 
additive white Gaussian noise (BAWGN) channel. In a second 
part, we consider the scenario of parallel channels. Using 
DE, we numerically find the achievable region of two BECs 
under BP decoding. As a more practical case, we revisit the 
practical application of BICM over the AWGN channel. In 
comparison to | |T0|, 01 ^ we apply a more precise numerical 
method for DE ]13| App. B] and no Gaussian approximation 
of channel densities. One noteworthy observation is that the 
tail-biting SC-LDPC code with an optimized bit-interleaving 
outperforms the terminated SC-LDPC codes with the same 
parameters and the same energy per bit. However, it is not clear 
if this generally applies for all tail-biting SC-LDPC codes. 
Finally, we show how the performance is degraded and how 
the optimal interleaving needs to be modified if accounting for 
a size-constrained encoding buffer. 

11. Tail-biting Spatially Coupled LDPC Codes 

To construct tail-biting SC-LDPC codes, we first lay out 
a set of spatial positions indexed by integers z S [0, L) on 
a circle, where L denotes the replication factor. We fix a 
“smoothing parameter” lu S N. To each position z, we assign 
n code bits and m parity check constraints (hence in total 
N = Ln code bits and M = Lm parity constraints). We 


assign a variable node to each code bit and assign a check 
node to each parity constraint. 

To construct a random instance of the tail-biting SC- 
LDPC(fi^, dc, ensemble, we connect the variable 

nodes and the check nodes in the following manner: Each 
variable node at position z is connected randomly to dy 
check nodes lying within the range (([z, z + w — l]))j^, where 
((x))i^ returns the remainder of the integer division of x by 
L (“x modulo L”). Equivalently, each check node at position 
z is connected randomly to variable nodes in the range 
{{[z — w + 1, z ]))Eor additional details, we refer the reader 
to 1^, p^ . If all code bits in positions z S [0, tc — 2] are 
fixed to be zero, the code ensemble becomes a (terminated) 
SC-LDPC ensemble Q. 

When n oo, it is common to use density evolution 
(DE) p3j to evaluate the asymptotic performance of the code 
ensemble under BP decoding. This method estimates how the 
empirical distribution of the LLRs of code bits at each position 
z evolves iteratively during BP decoding, given the empirical 
distribution of the received bits’ LLRs. The DE equations for 
SC-LDPC codes over a BMS channel are detailed in 0. Eor 
the case of BICM and BAWGN channels, we use the numerical 
DE method in | [T3] App. B]. Eor transmission over BECs, the 
LLR distribution can be represented by a scalar value, the 
erasure probability, and the DE equation turns into a scalar 
update recursion Q. In that case, the update equation becomes 



where Sz denotes the average erasure probability of code bits 
at spatial position z and denotes the average erasure 
probability of the outgoing messages from code bits in position 
z and at iteration t. We initialize = 1 for all z G [0, L). 
Eor details, see ©■ 


III. Optimal Shortening Over a BMS 


Consider the tail-biting SC-LDPC((i„, dc, L, w) ensemble. It 
is known that the BP threshold and the MAP threshold of this 
ensemble are equal to the BP threshold and the MAP threshold 
of the underlying LDPC(dt,, dc) ensemble 0. With a properly 
selected shortening, the BP threshold of the spatially coupled 
ensemble saturates to the MAP threshold. We shorten the code 
by setting some code bits to zero, as indicated in Pig. □ by 
red variable nodes. We thus inject some prior knowledge that 
can be used to trigger BP decoding. Let be the fraction 
of randomly shortened code bits (i.e., they are fixed to zero) 
for z G [0,L). Eor example, we recover the terminated SC- 
LDPC(ci^, dc, L, w) ensemble for = 1 for 0 < z < w — 2 
and az = 0 otherwise. This random shortening decreases the 
design rate of the code from R = 1 — dy/dc to 


R{a) = 1-^ 
dr 
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Fig. 1. Tanner graph of a tail-biting SC-LDPC= 2,dc = 4,L = 
12, ID = 2, n = 10) code. Red variable nodes show shortened code bits. 


where a = (aoi ■ • ■) cii-i)- For simplicity, we explain our 
optimization method for transmission over a BEC with erasure 
probability e. In this case, the average fraction of code bits 
erased by the channel at position z is = (1 — az)e. 
Equivalently, the code bits at position z ’’see” a BEC with 
erasure probability e^. Let Pe{a,T) denote the bit erasure 
probability of BP decoding after T iterations. 

Let Sgp and denote the BP threshold and the MAP 
threshold of the uncoupled LDPC(dt,, dc) ensemble. Eor any 
£ G (ebpi^map), our goal is to find a which maximizes R(a) 
under successful BP decoding, i.e.. 


«opt(£) = argmax{i?(a) I lim Pe(a, T) = 0}. (2) 

a T—>-oo 

Eor numerical computation, we relax this maximization: 

(i) Eor a given a, define Ts{a) as the smallest iteration t 

such that F \x^z~^'^ - < 6. 

(ii) We find 

g*{e) = argmax{i?(a) | Pe{a,Ts{a)) < d}. (3) 

oc 


We fix d = 10”^ throughout the paper. To find a*(e), 
we apply two sub-optimal algorithms: (i) Eor small L, an 
exhaustive search is carried out over the discretized space 
of a. Each az is quantized with resolution A = 10“^. (ii) 
The sub-optimal differential evolution algorithm GD is also 
used for large L which is generally much faster than an 
exhaustive search. We observe a good consistency between 
the results of both algorithms. Eor example, the optimized 
a* = (0.324, 0.149, 0.350, 0,..., 0) is obtained for the tail- 
biting SC-LDPC((it, = 3,dc = 6,L = 50, w = 3) ensemble 
used over a BEC with e = 0.48. We also consider a simpler 
scheme denoted “uniform shortening”, defined as 


auni(-B) = 



Q< z < B 

otherwise 


(4) 


where 0 < B < L. Similarly, we can find 


Au„i(^) = arg max {i?(a) | Pe{a,Ts{a)) < <5}. (5) 

B,auni(B) 

The above optimization is much simpler. For each B, it 
is a one-dimensional optimization over a bounded interval. 
The BP performance is monotonic in a, and thus, we can 









Fig. 2. The design rate R{a) of the tail-biting SC-LDPC(3, 6, 50, 3) 
ensemble shortened according to either (i) the optimized a* (e) (blue curve) 
or {ii) «*„;(£) (red curve). These curves are compared with the design rate 
of the terminated SC-LDPC(3, 6, 50, 3) ensemble (green curve). 


simply use algorithms such as the bisection method to find 
the best a„„i(i3). Then we change B to find Fig. 

illustrates the optimization results for the tail-biting SC- 
LDPC(3,6,50,3) ensemble. The maximum design rate is 
computed for both a*{e) and and for e S (eBP,eMAp)- 

We observe that both optimized shortenings reduce the rate- 
loss by more than 50%. Moreover, there is only a very small 
difference between the rate of a*(e) and a*„i(e). We observe 
the same behavior for tail-biting ensembles of LDPC(4, 8) 
and LDPC(5,10) codes. Note that the error probability of BP 
decoding is monotonically increasing in terms of e. Therefore, 
a*(£MAp) is also feasible for e < £map- This a*(£MAp) can be 
universally used for all £ G (£bp,£map) with more than 50% 
rate-loss gain, even though it is not the best a for e < £map- 

Fig. shows the average bit erasure probability Pe under BP 
decoding for the tail-biting SC-LDPC(3,6, 50,3) code short¬ 
ened by a*(£MAp)- For comparison, we also plot the erasure 
probability curve of the terminated SC-LDPC(3,6, 50,3) code, 
and the tail-biting code (without shortening). For all codes, 
n — 2000. We observe that the terminated SC-LDPC code 
has much smaller Pe than the tail-biting SC-LDPC code as 
the BP threshold of SC-LDPC code is increased to the MAP 
threshold. With shortening, not only the performance of the 
tail-biting SC-LDPC code improves to the performance of the 
terminated SC-LDPC code but also the shortened tail-biting 
SC-LDPC code has a larger rate than the SC-LDPC code. We 
also plot the performance of these codes over a binary additive 
white Gaussian noise (BAWGN) channel in Fig. The left 
sub-plot shows the bit error rate (BER) in terms of signal-to- 
noise (SNR) ratios Es/Nq. We observe the similar behavior as 
for BEC in Eig. In order to see the gain in coding rate, the 
same BER values are plotted in terms of E^/No = Eg/{RN q) 
in the right sub-plot. It shows that the larger rate leads to an 
additional net coding gain of « 0.1 dB. 

IV. Bit Interleaving Over Two Parallel Channels 

In this section, we consider the transmission over two inde¬ 
pendent parallel channels. We show that carefully interleaving 
code bits of tail-biting SC-LDPC code between channels leads 



Fig. 3. BP performance in terms of channel erasure probability e for a 
shortened tail-biting SC-LDPC code (blue curve), a terminated SC-LDPC 
code (red curve), and a tail-biting SC-LDPC code (green curve). For all codes, 
(du, dc,-L, Lf;, n) = (3,6,50,3,2000). 
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Fig. 4. BP performance in the BAWGN channel for a shortened tail-biting 
SC-LDPC code (blue curve), a terminated SC-LDPC code (red curve), and a 
tail-biting SC-LDPC code (green curve). For All codes, (c(„, dc, L, w, n) = 
(3, 6, 50,3, 2000). Left: BER over Es/Nq. Right: BER over Ei,/Nq. 

to significantly improved BP decoding performance without 
rate-loss. The results can be extended to more than two parallel 
channels in a straightforward manner. 

A. Parallel BECs 

We explain our optimization method using two independent 
parallel BECs with erasure probabilities £i and £2 and where 
both channels shall be used an equal number of times. In this 
case, the low complexity of DE analysis allows us to evaluate 
the complete achievable region of (£i,£ 2 ) under BP decoding 
and find the optimal bit interleaving. 

We assume that the transmitter knows the channel param¬ 
eters £1 and £2 and w.l.o.g. that £1 < £ 2 . The code bits 
are transmitted over either the first or the second channel. 
If exactly half of the code bits in each spatial position are 
passed through each channel, this is equivalent to transmitting 
over a BEC with erasure probability -I-£ 2 )- In this case, 
without termination, the tail-biting SC-LDPC ensemble cannot 
be decoded successfully unless + £ 2 ) < £bp- 

We show that we can exceed this bound if we carefully 
interleave the code bits of different spatial positions between 
channels. Let f3z denote the fraction of code bits in spa- 
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Fig. 5. Interleaving the code bits of different spatial positions between two 
parallel independent BECs such that for the average it holds fii < f-i 2 - 


tial position z transmitted over the EEC with ei. Clearly, 
Pz = c/2. The average erasure probability at position 
z m £z = PzSi + (1 — I3z)£2- Now, the problem of hnding 
optimal becomes similar to the best shortening optimization 
presented in Section III We already observed in Fig. that 
there is a very small difference between uniform shortening 
and non-uniform shortening. Due to its simplicity, we employ 
an approach similar to uniform shortening and consider the 
setup illustrated in Fig. 


, , I /3o) 0 < z < B 

—uni' ' I -2-^Po , > R 
f L-B ’ P ^ 


Ez = 


0< z <B 
z> B 


for some B < Lj^. It becomes now equivalent to the 
shortened code over a single EEC with erasure probability 
/i 2 with uniform shortening a = /ir/M 2 in Eq. 0 . It is easy 
to check p 2 < Jjjjow 

all the feasible 3-tuples (/ii,/i 2 , B), we can determine the 
achievable region (ei,e 2 )- The idea is as follows: if p,i < Ebp 
and B is large enough, the EP algorithm can decode some code 
bits lying in the range z < B. In particular, the “computation 
graph” of some code bits has enough non-erased variable 
nodes for the EP algorithm to recover them. Those recovered 
code bits then play the role of the effective termination for 
the coupled ensemble if /i 2 is small enough (i.e., ^2 < Emap)- 
Note that successful EP decoding is not attained by merely 
interleaving if either £2 > £1 > £bp or £1 + £2 > 2 e„Ap. 

For a given (/i 2 ,B), let /(/i 2 ,B) denote the largest pi for 
which EP decoding succeeds (see (|^). We have £i < Ebp and 


2 



^^2 < 



£1 + £2 


£r + £2 < B) -f 2 ( 1 



( 6 ) 


One can show that the DE equation is monotonically 
increasing in terms of e^. Therefore, if (£ 1 , 62 ) is achievable 
under EP decoding, then for all e/ < £1 and £2 < £ 2 , 
(£ 2 ,£ 2 ) is also achievable under EP decoding (with differ¬ 
ent (/ri,/r 2 ,B)). Using 0 , we can numerically calculate 
the maximal achievable region of (£i,£ 2 ) which can be 
decoded successfully in the limit of the code-length. We 
plot this achievable region in Fig. for the tail-biting SC- 
LDPC(3,6,L,3) ensemble with L = 25, 50 and 100. The 



Fig. 6. The achievable region under BP decoding for the tail-biting SC- 
LDPC(3, 6, L, 3) ensemble via suitable channel interleaving. The region is 
plotted for L = 25, 50 and 100. 


achievable region under MAP decoding is £1 -f £2 < 2£„ap- 
We observe that by suitable interleaving, the achievable region 
under EP decoding almost covers the entire MAP achievable 
region except for a small triangular region in which £1 > £bp 
and £2 > £bp. As L grows, the coverage saturates to the MAP 
achievable region (except for £1 > £bp and £2 > £bp). 

B. 4-ASK BICM over the AWGN Channel 

Eit-interleaved coded-modulation (EICM) scheme is a prac¬ 
tical pragmatic scheme to approach the constellation con¬ 
strained capacity of the AWGN channels with higher or¬ 
der modulation formats GD- In EICM over a real-valued 
channel, we map k (possibly interleaved) code bits to 2 ^ 
symbols of an Amplitude Shift-Keying (ASK) constellation 
{±1, ±3,..., ±(2^“^ -f 1)} using a Gray map. The LLRs of 
the transmitted k code bits are computed using a so-called 
bit-metric decoder: The LLRs are generally different for each 
code bit and some of the bits may be more reliable than others, 
depending on the mapping used. Interleaving is used to break 
the correlation between the different bits. 

For a tail-biting SC-LDPC code, this interleaving can be 
carried out separately in each spatial position. Assuming 
uncorrelated LLRs, we will have k parallel channels with 
different LLR distributions m- It has been shown in in), 
m that we can approach the EICM capacity using only a 
single (terminated) SC-LDPC code. A suitable interleaving 
scheme can significantly improve the EP performance of 
tail-biting SC-LDPC codes over such parallel channels with 
different reliability as occun'ing in EICM. More interestingly, 
such a code can outperform the terminated SC-LDPC code 
with the same design parameters and with the same energy 
per transmitted bits, as we will show below. Note that gen¬ 
erally speaking, we need to consider two distinct interleaving 
schemes, which are however usually combined in practice: 
One is required to remove the correlation for EICM, and the 




















other one, which we consider, enables threshold saturation. 

Here we focus on the 4—ASK modulation schem^ over 
an AWGN channel with SNR 77 = Es/Nq. BICM yields 
two binary parallel channels, namely 01 ( 77 ) and 02 ( 77 ), with 
different LLR distributions. To use DE, the channels must be 
symmetric, which can be ensured by “channel adaptation” as 
detailed in ( 11 - 

Recall that Pz is the fraction of code bits in spatial position 
z transmitted over 01 ( 77 ) (with constraint Pz = L/2). 

Let PeiP, T) denote the error probability of BP decoding after 
T iterations. We define 


^opt = argmin{77 | lim Pe{f3, T) = 0}. (7) 

—“P’- /3 T—>00 — 


Similar as in Section III we relax the optimization problem 
by introducing a new condition Pe{f3,T) < 10“® for T < 
1000 iterations. We consider two distinct setups, first uniform 
interleaving with 


^ ■{b) = {7L 


BPo 


L-B ’ 


0 < z < i? 
z> B 


and second non-uniform interleaving with (3 = 

(/3o,/3i, /3 l-i) and find the minimum 77 for both 

setups. For uniform interleaving, we select the minimum 
77 over all 0 < i? < L. For non-uniform interleaving, we 
use the technique of differential evolution to find the best 
sub-optimal /?*, similarly to fTol. To evaluate Pe{/3,T), we 
numerically compute the LLR dls^tributions of 01 ( 77 ) and 02 ( 77 ) 
for each 77 and then use quantized DE analysis (uniformly 
quantizing the LLR range [—20, 20] with 1000 levels). This 
method is more accurate than the P-EXIT method and 
Gaussian approximation of channel’s LLR distributions used 
in pO) , flT) and may potentially lead to improved results, 
however, with a significantly increased (offline optimization) 
complexity. 

As an example, consider the tail-biting SC-LDPC(3,6,50,3) 
ensemble. Without interleaving (/3^ = i), the BP threshold 
is 77 = 3.399 dB while the BP threshold of the terminated 
ensemble is 77 = 2.658 dB. By a uniform interleaving, the BP 
threshold of tail-biting ensemble improves to 77 = 2.809 dB 
and it is even slightly more improved to 77 = 2.804 dB by the 
best non-uniform interleaving we have found. 

Since the terminated ensemble has a smaller rate, it is more 
fair to compare the BP thresholds in terms of Eb/No. The 
rate of tail-biting ensemble is 1/2 and thus, E^/Nq = Es/Nq. 
However, the rate of the terminated ensemble is 0.481 and 
thus its BP threshold is E^/Nq = 2.826 dB, larger than the 
BP threshold of tail-biting ensemble with optimal interleav¬ 
ing. This superior performance can be also seen in Fig. [^ 
comparing the averaged BP performance of actual codes with 
finite n = 2000. In terms of E^/Nq, we observe a net coding 
gain of « 0.04 dB in this simulation. 


*4-ASK modulation represents one quadrature component in a 16-QAM 
modulation, the results discussed here therefore directly apply to 16-QAM. 



Fig. 7. BP performance of the 4-ASK BICM scheme over the AWGN channel 
using an optimized tail-biting SC-LDPC code (blue curve), a terminated SC- 
LDPC code (red curve), and a tail-biting SC-LDPC code (green curve). For 
all codes, (d„, dc, L, w, n) = (3, 6, 50, 3, 2000). Left: BER values in terms 
of Es/Nq. Right: BER values in terms of Ei,/Nq. 


V. Encoding Queue For Bit Interleaving 

The convolutional structure of SC-LDPC codes (including 
tail-biting codes) enables sequential encoding of information 
bits as they are produced with a structure akin to shift registers. 
In particular, the code bits in spatial position z are generated 
from the code bits in position [z — w, z] 0 On the other hand, 
the windowed BP decoder enables sequential decoding of 
code bits as they arrive at the receiver. These features make 
SC-LDPC codes particularly attractive for high-speed, latency 
sensitive applications (l). 

If we assume that the parallel channels should be used 
an equal amount of time, i.e., the gross transmission rate of 
the channels is identical, code-bit interleaving between spatial 
positions violates the equal fraction of code bits targeted for 
different channels. For example, in BICM, we always require 
an equal amount of all channels to be used in every frame 
to generate the modulation symbols. Therefore, some code bit 
must be stored in a buffer and transmitted later. This drawback 
causes queuing of code bits in the encoder and decoder and 
ultimately leads to an increase in transmission latency]^ 

In this section, we consider the size of this encoding 
queue as an additional constraint when optimizing the channel 
interleaving. For a given pair of channels, there are usually 
many j3 leading to practically the same “quasi-optimal” BP 
performance. Our goal is to And the one that leads to the 
encoding queue of minimum size. 

For simplicity, we focus here on two parallel BECs but 
the results can be carried immediately over to other channel 
models, such as BICM as shown above. We assume that the 
code bits are generated sequentially from position z = 0 to 
z = L — 1 with a constant speed. We assume that the encoder 
processed each spatial position as a block. Recalling that /3z 

^The range of required information bits may change for different encoders. 
It is impoifant that its size is 0{w) <g L. 

^Note that we do not include the additional buffering required for tail-biting 
in our considerations here, as this amount stays constant irrespective of the 
interleaving. 
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Fig. 8. The encoding queue length Q(/3) of a tail-biting SC-LDPC(3,6,50,3) 
ensemble over BECs (ei ,£2 = 0.55) and ei S (0.3,0.39). 


is the fraction of code bits in position x to be transmitted over 
the first channel, then it is clear that only n ■ min(/3^, 1 — /3z) 
bits can be forwarded to the parallel channels to be mapped 
onto modulation symbols and the remaining n|l — 2f3z\ bits 
need to be stored in the queue, waiting further processing. 
Let Q{z) = ~ 1) denote the length (in bits) of 

the encoding queue after encoding spatial positions up to z, 
where Q{z) > 0 denotes an excess of first channel bits and 
Q{z) < 0 an excess of second channel bits. The total required 
memory size for the queue is Q{13) = max^ |( 3 ( 2 :)| which also 
shows the extra encoding latency. 

Consider now a pair of BECs (£i,£ 2 ) in the achievable 
region of Fig. Our objective is to minimize the queue, i.e., 

^min = nun{Q{/3)| ^1^ Pe(/3, T) = 0}. (8) 

We relax the above optimization similarly as in ([^ and we 
consider again both the uniform interleaving /3^.^(i3), and 
the non-uniform interleaving in which we use the differential 
evolution to find B* . with the smallest queue. For uniform 
interleaving, we choose the minimum queue B . (B) 

—inin.uni' ' 

over all B. The optimization results for the tail-biting SC- 
LDPC(3,6,50,3) ensemble are depicted in Fig. For the 
sake of a clear presentation, we fix £2 = 0.55 and find 
the minimum queue size for 0.3 < £1 < 0.39. As we 
can see, the required queuing memory can be significantly 
smaller using a non-uniform interleaving. As discussed in 
Section |IV-A[ the optimal non-uniform interleaving improves 
the BP performance only slightly in comparison to the optimal 
uniform interleaving. However, it can have many practical 
advantages such as large memory savings for realizing the 
encoding queue. Fig. [9] shows the non-uniform interleaving 
found for (£1 = 0.375, £2 = 0.55). It is interesting that as 
a result of such interleaving, the average erasure probability 
Ez = /3 z£i + (1 — /3z)£2 > ^MAP in some spatial positions. 

VI. Conclusion 

Rate-loss mitigation of spatially coupled codes is one of the 
major challenges towards a practical implementation of this 
class of codes. We have shown that a significant rate-loss re¬ 
duction can be obtained from shortening tail-biting SC-LDPC 
codes. For the case of SC-LDPC(d.u = 3, dc = 6 , L, tu = 3) 
codes, we can reduce the rate-loss by more than 50% by 




LDPC(3, 6, 50, 3) ensemble over two BECs (ei = 0.375, £2 = 0.55) and 
with the optimized non-uniform interleaving 0* . . Each curve represents 
after t = 1,40, 80,... iterations. 


a suitable shortening pattern. The shortened tail-biting SC- 
LDPC codes can outperform terminated SC-LDPC codes for 
transmission over AWGN channels. 

Rate-loss mitigation can be fulfilled effectively when the 
transmission takes place over more than one binary-input 
channel, as is the case in BICM. The extra available channel 
dimensions can be exploited by properly interleaving code bits 
in different spatial positions among channels. For the case 
of two parallel BECs, we show that a tail-biting SC-LDPC 
code under BP decoding can be operated almost anywhere 
within the achievable region of MAP decoding. Optimized bit 
mappings for higher order modulation can help to increase 
the spectral efficiency of the code. However, the price is a 
mandatory buffer at the encoder. This effect can be mitigated 
by non-uniform channel allocation patterns, which show a 
better performance than the uniform schemes. 
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